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ABSTRACT 


A  test  of  fit  based  on  minimum  chi-square  techniques  is  developed 
for  continuous  distributions.  This  procedure  is  investigated  in  detail  for  the 
special  case  of  testing  for  normality,  where  the  test  statistic  is  based  on 
the  first  four  sample  moments.  The  asymptotic  non-null  distribution  of  the 
general  test  statistic  is  obtained,  and  in  particular  the  power  of  the  test 
of  normality  is  derived  for  several  alternative  families  of  distributions. 


A  TEST  OF  FIT  FOR  CONTINUOUS  DISTRIBUTIONS  BASED  ON 
GENERALIZED  MINIMUM  CHI-SQUARE 

]ohn  Gurland  and  Ram  C.  Dahiya 

1.  Introduction 

In  this  paper  a  test  of  fit  for  continuous  distributions  is  devr  .oped  based 

on  generalized  minimum  chi-square  techniques.  Although  the  Pearson  chi- 

square  test  of  fit  is  widely  used  especially  in  the  case  of  discrete  distributions, 

there  are  difficulties  in  applying  it,  particularly  in  the  case  of  continuous 

distributions .  A  discussion  of  these  difficulties  is  included  in  the  paper  by  Dahiya 

and  Gurland  (1970a).  A  motivation  for  obtaining  the  results  in  the  present  paper  i-  to 

develop  a  test  which  is  free  of  complications  associated  with  the  Pearson  chi-square 

test.  In  particular  the  question  of  how  to  form  class  intervals  does  not  arise  in  the  test 

2 

of  fit  presented  here.  Furthermore  the  asymptotic  dis  tribution  is  exactly  that  of  a  x  , 

in  contradistinction  to  the  asymptotic  distribution  of  the  statistic  employed  in  the 
_  2 

Pearson  x  test  when  the  estimators  of  parameters  are  obtained  from  the  ungrouped 
sample  (of.  Chernoff  and  Lehmann  (1954)). 

The  asymptotic  non-null  distribution  of  the  test  statistic  proposed  here 
is  developed  for  general  alternatives.  As  a  special  case  the  asymptotic  power 
is  obtained  for  testing  normality  against  s>*  *er  •  specific  oite-  n^.lvo  fn»;n.u-;  .• 

distributions.  The  power  of  this  test  is  compared  with  that  of  a  modified  form  of 
the  Pearson  chi-square  'est  based  on  random  intervals  presented  by  Dahiya 
and  Gurland  (1970a,  1970b). 

This  work  was  supported  in  oart  by  the  National  Science  Foundation,  the 
Wisconsin  Alumni  Research  Foundation,  and  the  United  States  Army  under 
Contract  No.:  DA-31-124-ARO-D-46  2. 


Although  the  test  of  fit  presented  here  is  for  continuous  distributions, 
the  method  based  on  minimum  chi-square  techniques  is  quite  general  and  can 
in  fact  be  adapted  to  discrete  distributions.  Hinz  and  Gurland  (1970)  have 
applied  such  techniques  to  develop  a  test  of  fit  for  the  negative  binomial 
and  other  contagious  distributions. 

2.  Formulation  of  a  test  statistic  based  on  sample  moments 

First  we  consider  the  problem  in  a  general  context  and  show  how  to 
construct  a  statistic  for  testing  the  fit  of  a  hypothesized  distribution  based  on 
a  set  of  sample  moments.  In  a  subsequent  section  the  result  obtained  here 
will  be  applied  to  develop  a  test  of  normality. 

Let  Xp  X2,  .  . . ,  X^  be  a  random  sample  from  a  certain  distribution 
with  p.  d.  f. 

Px(x  |  9)  (2.1) 

where  0  is  a  parameter  vector  of  q  components,  that  is 


0’  =  [  0p  0p  . . . ,  0  ] 


th 

Denote  the  j  raw  sample  moment  by 


n 


m 


,.iy  x) 

>~ni%  1 


Let 


m'  =  [mj,  m^,  ...,  m|J 


(2.2) 


(2.3) 


(2.4) 
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where  s,  (s  >  q),  is  a  fixed  number  that  remains  to  be  specified.  (A  low 

value  of  s  is  generally  desirable  due  to  the  large  sampling  fluctuations  of 

th 

higher  order  moments. )  Under  the  assumption  that  the  (2s)  order  moment  of 
X  exists,  we  can  easily  show  by  making  use  of  the  Central  Limit  Theorem  that 
the  asymptotic  distribution  of  '/n(m-p)  is  normal, 

N(0;  G)  (2.5) 

where  the  vector  ^  is  the  population  counterpart  of  m,  given  by 

p'  =  [  pj,  P^j  •  •  • ,  Pg]  (2.  6) 

and  the  s  X  s  covariance  matrix  G  is  given  by 


(2.7) 


(2.  8) 
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is  given  by 


[  N(0;  2)  (2.11) 

i 

i 

where 


h'  -  [ h^,  •••>  hg] 
4'  =  [4p  42,  ...»  4g] 
S  =  JGJ* 


94 


L 


and  J  is  the  sXs  Jacobian  matrix  (-r-f)  • 


From  this  result  it  follows  that  the  asymptotic  distribution  of 


(2.12) 


Q  =  n(h  -  4V  S_1(h-U 


(2.13) 


2  * 

is  that  of  x  •  Furthermore,  if  2  is  a  consistent  estimator  of  2  ,  which  is 
s 

obtained  from  2  on  replacing  parameters  by  maximum  likelihood  or  some 
other  consistent  estimators,  then  according  to  Gurland  (1948),  Barankin  and 
Gurland  (1951),  the  asymptotic  distribution  of 

Q*  =  n(h  -  4)'  £_1(h  -  4)  (2.14) 


is  the  same  as  the  asymptotic  distribution  of  0. 

Now  suppose  we  select  functions  h^  such  that  ^  are  linear  functions 
of  the  parameters  6^,  0^,  . . . ,  8^,  that  is 

4  =  W6  (2.15) 


-4- 
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where  W  is  a  sXq  matrix  of  known  constants.  In  such  a  case  we  can 

* 

find  an  estimator  for  6  by  minimizing  the  expression  for  Q  in  (2.14) 

A 

This  estimator,  0 ,  say,  is  given  by 

0  S  (W' £-1wf  Sv  s"1  h.  (2.16) 


Let 


i  =  we 


and 


Q  =  n(h  -  E)'2-1(h  -  l)  . 


Now  let 


Then 


R  =  W(W'  Z_1W)”1W' 

*  A  _  1  A 

A  =  2  (I  -  R)  . 


0  =  n(h  -  Rh)'£-1(h  -  Rh)  =  nh'(I  -  R)£_1(I  -  R)h 


=  nh'Ah  . 


(2.17) 


{  '  18) 


(2.19) 


(2.  20) 


From  results  of  Gurland  (1948),  Barankin  and  Gurland  (1951),  the  asymptotic 

A 

distribution  of  nh'Ah  is  the  same  as  the  asymptotic  distribution  of  nh'Ah 

A  A 

where  A  is  obtained  from  A  on  replacing  E  by  2  .  In  order  to  find  the 
distribution  of  nh'Ah  we  make  use  of  the  following  lemma. 
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Lemma  1. 


If  X  is  distributed  as  N(|jl;  2)  and  B  is  a  matrix  such  the  ZB  is 

idempotent  then  the  distribution  of  X'BX  is  non-central  chi-square  with  r 

2 

degrees  of  freedom  and  noncentrality  parameter  X,  denoted  by  ^  , 
where  r  is  the  rank(B)  and  X.  = 

Proof: 

Let  P  be  a  nonsingular  matrix  such  that 

P2P1- I,  (2.21) 

an  identity  matrix.  On  making  use  of  the  transformation 

Y  =  PX  (2.  22) 

it  follows  that  Y  is  distributed  as  N(P}i*,  I)  and 

X'BX  =  Y'P'"1BP“iY  .  (2.23) 

Now  P'  *BP  *  is  an  idempotent  matrix  of  rank  r  since  ZB  =  P“*P'  *B 

2 

is  idempotent  of  rank  r.  Hence  the  distribution  of  X'BX  is  ^  ^  where 

X  =  (PH.)'P'"1BP"1(P^)  =  h'B(jl  (2.  24) 


This  proves  Lemma  1. 

From  the  above  lemma  and  also  assuming  W  of  full  rank  q  ,  we  see 

that  the  asymptotic  null  distribution  of  nh'Ah  is  since  ZA  is  an 

s-q 

idempotent  matrix  of  rank  s-q  and  £'A£  =0  which  can  easily  be  verified. 

*  2 

Thus  it  follows  that  the  asymptotic  distribution  of  Q  is  that  of  v 

As-q  ’ 
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The  statistic  Q  can  be  utilized  for  testing  the  fit  of  an  assumed 
distribution.  In  order  to  ascertain  how  well  such  a  test  of  fit  behaves,  its 
power  against  specific  alternatives  can  be  obtained  from  the  non-null 
distribution  given  in  section  3. 

3.  Asymptotic  non-null  distribution  of  Q 

A 

The  asymptotic  non-null  distribution  of  Q  turns  out  to  be  that  of  a 
weighted  sum  of  independent  non-central  x  random  variables  each  with  one 
degree  of  freedom.  A  derivation  of  this  result  along  with  the  precise  weights  and 
non-centralities  is  given  in  the  following  theorem. 


Theorem  1. 

Let  the  null  and  alternative  hypotheses  HQ,  respectively  be  as 
follows : 


H  :  X  has  p.d.f.  p  (x  |  0) 
G  X 


-oo  <X  <<»,  6'  =  [9j,  92,  •••,  9q] 


(3.1) 


Hj*.  X  has  p.d.f.  p^\x  |  y) 

-oo  <  x  <  oo  ,  y’  =  [  Yj,  Y2,  . . . ,  yp] 


(3.2) 


where  p  <  q.  Here  0  and  y  are  parameter  vectors. 

a 

Then  the  asymptotic  non-null  distribution  of  Q,  defined  in  (2.20), 
is  of  the  form 


1 


iXl,  a 


i 


(3.  3) 
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The  constants  ^  are  given  by  (3.7)  and  a^  by  (3.8),  (3.10). 

Proof: 

A 

Let  us  denote  the  matrix  to  which  Z  converges  under  in 

❖ 

probability  by  E  ,  that  is 

Z  ~  Z*  under  Hj  .  (3.4) 

* 

Then  Z  involves  the  parameter  vector  y  Now  the  asymptotic 

A 

non-null  distribution  of  nh'Ah  is  the  same  as  that  of 

(1)  * 

O'  1  =  nh'A  h  (3.5) 

*  *  A  *  (1) 
where  A  is  obtained  from  A  on  replacing  Z  by  E  .  Let  Z  denote 

the  asymptotic  ocvariance  matrix  of  ^ nh  under  which  can  be  found 
in  the  same  way  as  Z  is  found  under  H0  .  Also  if  t/  ^  denotes  the 
population  counterpart  of  h  under  H^,  then  the  asymptotic  non-null 
distribution  of  Vn(h-C^)  is  that  of  N(0;  Z^)  .  There  exists  a  non¬ 
singular  matrix  T  and  an  orthogonal  matrix  P  such  that 

TZ(1)T'  =1  (3.6) 

and 


3- 
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o 


-1  *  “1 

PT'  A  T  P'  =  D  = 


s-q 


(3.7) 


where  I  is  the  identity  matrix  and  D  is  a  diagonal  such  that  the  last  q 

* 

diagonal  elements  of  D  are  zero.  This  is  possible  since  rank(A  )  =  s  -  q. 
Let 


and 


u  =  PTh 

> 


V 


(3.8) 


Then  we  have 


nh'A*h  =  n(T”1P'u)'A*(T~1P'u)  =  nu'Du 


s-q 

Yj  d  Wnu  )  . 
1 


(3.9) 


Now  since  the  asymptotic  distribution  of  \l n(h  -  t^)  is  N(0*,  f 

it  follows  that  the  asymptotic  distribution  of  */n(u  -  *)  is  N(Oj  I). 

* 

Hence  the  asymptotic  distribution  of  nh'A  h  is  that  of 
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k 


i 


where 


»  =  VrT  ^  1=1,  . . . ,  s-c 


(3.10) 


and  4^  is  the  element  of  This  proves  the  theorem. 

A 

4.  Test  of  fit  for  normal  distribution  based  on  Q 

A 

We  shall  now  consider  a  test  of  fit  based  or.  Q  when  the  null 
distribution  is  normal,  that  is,  X  has  p.d.f. 

(x-Oj)2 


l  2e? 

px(x  1  0)  =7^7* 


-oo  <  x  <  ®  ,  -»  <  0  <  eo  f  0  >  0  . 


(4.1) 


Let  m...  m„  and  m.  be  second,  third  and  fourth  central  sample 
c  3  4 

moments  respectively.  The  statistics  b2  given  by 

3/2  2 

bj  =  m3/m2  ,  b  =  m4/m2  (4.2) 

2 

are  sometimes  employed  for  testing  normality  by  means  of  skewness  and 
kurtosis.  Instead  of  considering  these  two  statistics  separately  it 
appears  more  rational  to  formulate  a  single  statistic  involving  the  first 
four  moments.  This  motivates  our  selection  of  functions  h^  based  on  the 
first  four  sample  moments.  The  mean,  variance,  third  and  fourth  central 
moments  of  X  are  respectively  given  by: 


-10- 
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u!  =  0.,  |JL  _  =  0_,  a  =0.  ll  „  =  30. 
lie  2  3  *  4  i 


(4.3) 


If  we  define 


e2  =  109  6 2 


V  =[^;,  log  n2,  m-3»  log(-j)] 


(4.4) 


then  the  elements  of  t,  are  linear  functions  of  the  parameters  0^  and  0 
We  can  now  write 


&  =W0 


(4.5) 


The  corresponding  hj  functions  are  given  by 


\  ~  mlJ  h2  =  log  m2*  h3  *  m3;  h4  =  log(  3  ) 


(4.6) 


where  mj  is  the  sample  mean  and  m^,  m^,  m^  denote  second,  third  and 

fourth  central  sample  moments,  respectively,  as  previously  indicated. 

The  transformation  from  sample  raw  moments  to  functions  h^  is 
achieved  in  two  stages,  that  is,  from  f  mj,  m'^,  m'^,  m^]  to 
(ml,  m  ,  m  ,  m  l  and  then  finally  to  (h  ,  h  ,  h  ,  h  ].  In  the  notations 

4  t  j  1  t  j  t 

of  section  3,  ^n(h  -  4)  is  asymptotic  N(0;  21  )f  where 
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s  =  1,1,0;;;. 


(4.7) 


Now  let 

£  =  (E)b  (4.11) 

e,  -  m2 

where  is  the  maximum  likelihood  estimator  of  6^.  Then  a  statistic 
* 

0  for  testing  normality  is  given  by 


Q  =  nh'Ah 


(4.12) 


-12- 
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where 


A  A  •}  A 

A  =  2  (I  -  R) 


R  =  WfW'Z'VfV'f”1 


(4.13) 


After  simplification  we  can  show  that 


A  = 


1.5 


-.  75 


0  l/(6m2)  0 


-.75  0 


.  375 


(4.14) 


Hence  a  simplified  form  of  Q  is  given  by 


0  =  nu’Bu 


(4.15) 


where 


m4 

u'  =^h2’  h3»  h4^  =f1°9<rn2)»  m3»  lo9(  3)] 


(4.16) 


and 


B  = 


1.5 

0 

-.75 


i/(6m2) 


-.  75 
0 

.  375 


(4.17) 


The  statistic  Q  in  (4.15)  can  easily  be  computed  on  a  desk  calculator. 
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*  2 

The  asymptotic  distribution  of  Q  is  x2  since  here  s  =  4  and 

q  =  2  in  the  notations  of  section  2  .  Thus  to  carry  out  a  test  of  fit  for 

normality  at  a  particular  level  of  significance,  one  merely  requires  the 

2 

corresponding  critical  point  of  the  x2  distribution. 


5.  Power  of  the  test  of  normality 

Let  p^(x|y),  where  y'  =  [  Yj>  y2i  •••»  Yp]  is  a  parameter  vector, 
denote  a  general  alternative  to  the  null  hypothesis  of  normality.  If  we  denote 
its  i**1  raw  moment  by  and  the  corresponding  central  moment  by 

then  the  asymptotic  non-null  distribution  of  */n(h  -  is  N(0,  E^), 

where 


(1) 


.(!)'  (1  )•  (1)  (1)  .  £4., 

4  *  iPj  ,  log  n2  ,  p3  ,  log  (—  )] 


.(1)  n(l)T(l)_(l).I(l)I(l).l 

*  -  (I2  h  ^  (J2  J1  ] 


i,  J  =  1,  2,  3,  4 


(5.1) 


2,1 

*3,1 


0 

1 


(5.2) 


with 
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*v*-< 


v«- ^ 

n.j'611!1’’  ;  *4,j*  -*!1’’ 


"l  o  0  0 

0  l/ji^  0  0 

0  0  10 


ooo  iU{j] 

4  - 


(5.  3) 


£  and  £^  are  the  asymptotic  covariance  matrices  of  */nh  under 

Hq  and  Hj  respectively  and  it  gives  an  insight  about  the  test  on  examining 

how  the  structure  of  these  two  matrices  differs  depending  on  the  distribution 

* 

assumed.  In  fact  the  sensitivity  of  the  test  based  on  0  will  depend  on  the 
difference  in  the  structure  of  these  two  matrices. 

Now  we  shall  prove  an  attractive  invariant  property  of  the  test  based 
on  O  defined  by  (4. 15). 


Theorem  2. 

* 

The  power  of  the  test  based  on  0,  defined  by  (4.15)  is  invariant 
with  respect  to  the  location  and  scale  parameters  of  the  alternative  distribution. 
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Proof; 


p  (1) 

Since  m2  —  under  Hj»  the  asymptotic  non-null  distribution 

A  * 

of  Q  =  nu'B  u  is  the  same  as  the  asymptotic  non-null  distribution  of 


Q(1)  =  nu'B(1)u 


(5.4) 


where 


1.50  0  -.75 

0  l/(6p(2l))3  0 

-.75  0  .375 


(5.  5) 


The  distribution  of  u  is  invariant  with  respect  to  the  location 
parameter  and  B^  does  not  involve  this  parameter,  hence  it  follows  that 
the  asymptotic  non-null  distribution  of  does  not  involve  the  location 
parameter. 

Now  let  p  be  the  scale  parameter  in  the  alternative  distribution 
of  X.  If  we  take 


(5.6) 


then  the  distribution  of  Y  does  not  involve  p. 
Let  V'  =  [Vj,  V2,  V3)  be  such  that 


Vj  =  Uj  -  2  log  p  =  log  m2(y) 


V2=u2/P' 


=  m3(y) 


V3  =  u3  -  4  log  p  =  log  m4(y)  -  log  3 


(5.7) 
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where 


m^y)  =  i  =  2,  3,  4  . 


(5.8) 


Then  the  distribution  of  V  does  not  involve  the  parameter  p  since 

the  distributions  of  m.(y).  m.(y)  and  m.(y)  do  not  Involve  this  parameter. 

c  3  4 

If  p2(Y)  be  variance  of  Y  then  we  have 

li”'  =  ti2(Y)p2  (5.9) 

and 


Q(1)  .  u'B(1)u  «  [  Vj  +  2  log  p,  V2p3,  V3  +  4  log  p]B 


(1) 


Vj  +  2  log  p 

V3 

lV3  *  4  log  p| 


(5.10) 


=  V'B  V 


where 


B  ■ 


1.50  0  -.75 

0  1/(6^3<Y))  0 

-.75  0  .  375^ 


u 


It  is  surprising  that  although  the  scale  parameter  p 
and  B^,  it  cancels  out  in  u'B^u  as  is  evident  in  (5. 


is  involved  in 
10). 
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r 


* 

Since  the  distribution  of  V  does  not  involve  (3  and  since  B  is 
also  free  of  this  parameter,  the  asymptotic  distribution  of  and  hence 

A 

that  of  0  does  not  involve  the  scale  parameter  p.  This  completes  the 
proof  of  Theorem  2. 


A 

6.  Calculation  of  power  for  the  test  of  normality  based  on  0 

A 


For  studying  the  behavior  of  the  test  of  fit  for  normality  based  on  Q 
we  have  carried  out  power  computations  for  several  alternative  families  of 
distributions.  The  null  hypothesis  has  been  stated  in  (4.1)  and  the  test 
statistic  (5  formulated  in  (4.15). 

The  following  alternative  distributions  A^,  A^,  k^f  A 5  are 
considered. 

Aji  Exponential 


(1) 


(x  |  Y)  =  — 


x  >  v. 


-°°  <  < 


00 


y2>0 


A^:  Double  Exponential 


(x  I  \)  = 


e 


< 


-oo  <  x  <  oo 


<  Yj  <  Y2  >  0 
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A^:  Logistic 


X“Y, 


P^(x  |  v)  = 


X"Y 


1 


-oo  <  x  <  oo 


^2  2 
Y,(l  +  e  r 


-00  <  v  1  <  °o  >  V,  >  0 


A^:  Pearson  Type  III 


X-Yi 


'2  x-y 


(D/v  I  >  _  e 

px  (x  1  vl  -  v,np)  'v 


( — i)p-' 


x  >  Y, 


-00  <  y  <  00,  y  >0,  p  >  0 


A  :  "  Power  Distribution" 
5 


(1),  |  . 
Px  (x  |  y) 


1 

2 


X“Y, 


2 

1+p 


-OO  <  X  <  00 


-°°  <  y ,  <  00 ,  yP  >  o  ,  p  >  -l  . 


All  the  alternatives  A.  -  A  ,  inclusive,  involve  unknown  parameters 

1  5 

Yj  and  y^  which  are  location  and  scale  respectively.  Thus  the  power  will 
be  the  same  for  all  possible  values  of  y^  and  y  according  to  the  result 
proved  in  Theorem  2. 
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The  asymptotic  power  is  given  by 


<“<£  V|!  a  2  *\M) 

1  1  !»ai  2 


(6.1) 


♦  2 

where  the  asymptotic  non-null  distribution  of  0  is  that  of  2j  d.X.  as 

2  1  1,3i 
proved  in  Theorem  1,  and  is  the  100(1— or)  percent  point  of  the 

X*  distribution. 

A  generalization  of  Gurland's  (1955,  1956)  Laguerre  series  expansion 
has  been  given  by  Kotz  et  al  (1967)  for  the  distribution  of  quadratic  forms 
in  non-central  normal  variates.  We  make  use  of  this  expansion  in  order  to 
compute  the  power  given  by  (6.1).  These  calculations  have  been  carried  out 
for  sample  sizes  n  =  50,  75,  100,  and  the  two  levels  of  significance  a  =  .05, 

.  01.  The  results  appear  in  Table  1  for  all  the  alternatives  A  through  A  , 

i  D 

with  several  different  specified  values  of  the  parameter  (3  in  the  case  of  Aa 

4 

and  A  as  indicated  in  the  table. 

5 

A  modified  form  of  the  Pearson  chi-square  test  has  been  considered  by 

2 

Dahiya  and  Gurland  (1970b)  where  the  test  statistic  is  denoted  by  X^.  According 
to  this  modification,  the  estimators  obtained  from  the  ungrouped  sample  are 
utilized  in  determining  the  class  interval  end  points  as  well  as  in  the  test  statistic 


2 

X  .  For  convenience  in  making  some  comparisons  with  the  Q  test  the  values  of 
R 

2 

power  of  the  test  against  the  alternatives  listed  in  Table  1  are  included  for 
.hose  cases  corresponding  to  sample  sizes  n  =  50,  100  which  are  available  from 
Dahiya  and  Gurland  (1970b).  These  values  are  enclosed  in  parentheses  and  are 
based  in  each  case  on  the  number  of  class  intervals  giving  the  maximum  power  in 
Tables  1,  2  of  Dahiya  and  Gurland  (1970b).  For  example,  in  the  case  of  alternative 
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n  =  50,  100  when  the  number  of  class  intervals  is  7,  and  in  the  case  of 


alternative  its  power  attains  maximum  values  .  547,  .  800  corresponding 
to  sample  sizes  50,  100  respectively  based  on  3  class  intervals. 

A 

It  is  evident  on  examining  the  values  of  power  for  the  0  test  in  Table  1 

that  for  most  of  the  cases  considered  there  its  value  is  higher,  and  sometimes 

very  much  higher,  than  the  value  for  the  test. 

As  we  examine  Table  1  in  detail,  we  note  that  for  alternatives  A^  and 

A2,  namely,  the  exponential  and  double  exponential,  the  power  is  rather  high. 

2 

For  the  exponential,  the  power  is  slightly  lower  than  for  that  of  the  ^  test 

whereas  for  the  double  exponential,  the  reverse  is  true.  For  a  logistic  alternative 

the  difference  in  the  power  of  the  two  tests  is  dramatic.  For  example,  when 

2 

n  =  100  the  power  of  the  ^  test  with  optimal  number  of  classes  k  =  3  is  .180 

A 

for  a  =  .  05  whereas  for  the  Q  test  the  corresponding  power  is  .654. 

As  regards  A4,  namely  the  Pearson  Type  III,  it  is  evident  from  the 
table  that  the  power  is  higher  for  low  values  of  the  parameter  p  and  decreases 
slowly  as  p  increases.  The  decrease  in  the  power  is  explained  by  the  fact 
that  the  alternative  tends  to  normal  as  p  becomes  increasingly  large.  As 
evident  from  the  few  values  of  power  of  the  test  appearing  for  alternative  A^ 

A 

it  behaves  similarly  to  the  Q  test  although  its  power  is  substantially  less. 


Alternative  A_  is  considered  in  the  table  with  values  of  p  decreasing 

r> 

from  3.0  to  -.95.  Similar  to  the  behavior  of  the  X^  test,  the  power  increases  as  p 
incr  ,‘ases  for  p  >  0,  and  it  also  increases  as  p  decreases  for  p  <  0,  which  behavior 
is  explained  by  the  fact  that  the  normal  distribution  is  a  special  caseof  the  family  A. 

with  p  =  0.  Tor  all  the  values  of  p  considered  here  except  p  -  -.  so,  the 

-  2 
power  of  0  test  is  obviously  higher  than  that  of  tne  X^  test. 
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TABLE  1 


Power  of  0  test  for  normality 


a  =  .  05  a  =  .  01 


Alternative 

n  =  50 

75 

100 

50 

75 

mHm 

A  = 

Exponential 

.927(1.000) 

.941 

.953 

(1.000) 

.892 

.913 

.930 

V 

Double  Exponential 

.833 

(.547) 

.858 

.879 

(.800) 

.754 

.789 

.818 

Logistic 

.606 

(.  128) 

.631 

.654 

(.  180) 

.465 

.495 

.  523 

V 

with  p  =  .  5 

.966 

.972 

.976 

.949 

.957 

.964 

A4! 

with  p  =  2. 0 

.865 

.898 

.923 

.804 

.849 

.883 

V 

with  p  »  2. 5 

.839 

(.502) 

.879 

.909 

(.864) 

.765 

.820 

.861 

V 

with  p  =3.0 

.814 

(.391) 

.860 

f  ^3 

(.716) 

.732 

.791 

.8  37 

V 

with  p  =  3.  5 

.790 

(.318) 

.841 

.881 

(.597) 

.698 

.  762 

.814 

V 

with  p  =  4.  0 

.767 

(.268) 

.822 

.865 

(.506) 

.667 

.734 

.789 

V 

with  p  =  5.  0 

.722 

(.  205) 

.784 

.834 

(.381) 

.608 

.680 

.741 

A4! 

with  P  =10.0 

.548 

.617 

.678 

.407 

.473 

.  534 

with  p  =  3. 0 

.996 

.996 

.997 

.994 

.995 

.  995 

V 

with  p  =  2.  0 

.974 

.976 

.978 

.960 

.963 

.966 

V 

with  p  =  .  95 

.815 

.842 

.866 

.730 

.  767 

.799 

V 

with  p  =  .  75 

.721 

(.  376) 

.759 

.792 

(.603) 

.604 

.652 

.695 

A  J 

with  p  =  .  50 

.527 

(.211) 

.  571 

.611 

(.  343) 

.  372 

.418 

.  462 

a5= 

with  p  =  .  25 

.  249 

.  271 

.  292 

.  117 

.  1  33 

.  149 

with  p  =  -.  SO 

.036 

(.144) 

.  105 

.216 

(.262) 

.002 

.009 

.  029 

\s 

with  p  =  -.  75 

.154 

.48  3 

.  785 

.008 

.078 

.  280 

Av 

with  p  =  -.  95 

,  328 

(.331) 

.779 

.  969 

(.  583) 

.027 

— 

.  237 

621 

n  =  sample  size 

>*  -  level  of  significance 

A j  corresponds  to  the  Pearson  Type  III  distribution 
A^  corresponds  to  the  "power  distribution" 


k 


and  found  to  have  lower  power  for  the  most  part. 
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